Dataset statistics
| Number of variables | 14 |
|---|---|
| Number of observations | 2797 |
| Missing cells | 31 |
| Missing cells (%) | 0.1% |
| Duplicate rows | 2 |
| Duplicate rows (%) | 0.1% |
| Total size in memory | 594.8 KiB |
| Average record size in memory | 217.8 B |
Variable types
| Numeric | 9 |
|---|---|
| Categorical | 4 |
| Text | 1 |
| Dataset has 2 (0.1%) duplicate rows | Duplicates |
Waterfront is highly imbalanced (94.9%) | Imbalance |
View is highly imbalanced (73.5%) | Imbalance |
Condition has 31 (1.1%) missing values | Missing |
sqftBase has 1650 (59.0%) zeros | Zeros |
YrRenov has 1616 (57.8%) zeros | Zeros |
Reproduction
| Analysis started | 2024-05-16 16:35:04.142090 |
|---|---|
| Analysis finished | 2024-05-16 16:35:22.174411 |
| Duration | 18.03 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
Bedrooms
Real number (ℝ)
| Distinct | 8 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3875581 |
| Minimum | 1 |
|---|---|
| Maximum | 8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 8 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.90458359 |
|---|---|
| Coefficient of variation (CV) | 0.26703117 |
| Kurtosis | 1.0264785 |
| Mean | 3.3875581 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.47633669 |
| Sum | 9475 |
| Variance | 0.81827147 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3 | 1266 | |
| 4 | 905 | |
| 2 | 345 | 12.3% |
| 5 | 205 | 7.3% |
| 6 | 41 | 1.5% |
| 1 | 25 | 0.9% |
| 7 | 9 | 0.3% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 25 | 0.9% |
| 2 | 345 | 12.3% |
| 3 | 1266 | |
| 4 | 905 | |
| 5 | 205 | 7.3% |
| 6 | 41 | 1.5% |
| 7 | 9 | 0.3% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 7 | 9 | 0.3% |
| 6 | 41 | 1.5% |
| 5 | 205 | 7.3% |
| 4 | 905 | |
| 3 | 1266 | |
| 2 | 345 | 12.3% |
| 1 | 25 | 0.9% |
Bathrooms
Real number (ℝ)
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1197712 |
| Minimum | 0.75 |
|---|---|
| Maximum | 5.75 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.7 KiB |
Quantile statistics
| Minimum | 0.75 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1.75 |
| median | 2.25 |
| Q3 | 2.5 |
| 95-th percentile | 3.5 |
| Maximum | 5.75 |
| Range | 5 |
| Interquartile range (IQR) | 0.75 |
Descriptive statistics
| Standard deviation | 0.74812036 |
|---|---|
| Coefficient of variation (CV) | 0.35292505 |
| Kurtosis | 0.64591642 |
| Mean | 2.1197712 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.39518554 |
| Sum | 5929 |
| Variance | 0.55968407 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.5 | 732 | |
| 1 | 477 | |
| 1.75 | 384 | |
| 2 | 270 | 9.7% |
| 2.25 | 259 | 9.3% |
| 1.5 | 188 | 6.7% |
| 2.75 | 149 | 5.3% |
| 3 | 105 | 3.8% |
| 3.25 | 76 | 2.7% |
| 3.5 | 75 | 2.7% |
| Other values (11) | 82 | 2.9% |
| Value | Count | Frequency (%) |
| 0.75 | 9 | 0.3% |
| 1 | 477 | |
| 1.25 | 1 | < 0.1% |
| 1.5 | 188 | 6.7% |
| 1.75 | 384 | |
| 2 | 270 | 9.7% |
| 2.25 | 259 | 9.3% |
| 2.5 | 732 | |
| 2.75 | 149 | 5.3% |
| 3 | 105 | 3.8% |
| Value | Count | Frequency (%) |
| 5.75 | 1 | < 0.1% |
| 5.5 | 2 | 0.1% |
| 5.25 | 1 | < 0.1% |
| 5 | 3 | 0.1% |
| 4.75 | 3 | 0.1% |
| 4.5 | 12 | 0.4% |
| 4.25 | 12 | 0.4% |
| 4 | 17 | 0.6% |
| 3.75 | 21 | 0.8% |
| 3.5 | 75 |
sqftLiving
Real number (ℝ)
| Distinct | 454 |
|---|---|
| Distinct (%) | 16.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2070.3843 |
| Minimum | 370 |
|---|---|
| Maximum | 6900 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.7 KiB |
Quantile statistics
| Minimum | 370 |
|---|---|
| 5-th percentile | 950 |
| Q1 | 1450 |
| median | 1940 |
| Q3 | 2540 |
| 95-th percentile | 3672 |
| Maximum | 6900 |
| Range | 6530 |
| Interquartile range (IQR) | 1090 |
Descriptive statistics
| Standard deviation | 860.19216 |
|---|---|
| Coefficient of variation (CV) | 0.41547463 |
| Kurtosis | 2.0459365 |
| Mean | 2070.3843 |
| Median Absolute Deviation (MAD) | 540 |
| Skewness | 1.0951491 |
| Sum | 5790865 |
| Variance | 739930.55 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1680 | 22 | 0.8% |
| 1660 | 21 | 0.8% |
| 1940 | 21 | 0.8% |
| 1300 | 21 | 0.8% |
| 1200 | 20 | 0.7% |
| 1800 | 20 | 0.7% |
| 1500 | 19 | 0.7% |
| 1480 | 19 | 0.7% |
| 1840 | 19 | 0.7% |
| 1230 | 19 | 0.7% |
| Other values (444) | 2596 |
| Value | Count | Frequency (%) |
| 370 | 1 | |
| 420 | 1 | |
| 490 | 1 | |
| 520 | 1 | |
| 550 | 1 | |
| 560 | 1 | |
| 580 | 1 | |
| 590 | 2 | |
| 620 | 1 | |
| 630 | 1 |
| Value | Count | Frequency (%) |
| 6900 | 1 | |
| 6630 | 1 | |
| 6490 | 1 | |
| 6070 | 1 | |
| 6050 | 1 | |
| 6040 | 1 | |
| 5960 | 1 | |
| 5850 | 1 | |
| 5584 | 1 | |
| 5520 | 1 |
sqftLot
Real number (ℝ)
| Distinct | 2033 |
|---|---|
| Distinct (%) | 72.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15216.336 |
| Minimum | 747 |
|---|---|
| Maximum | 1074218 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.7 KiB |
Quantile statistics
| Minimum | 747 |
|---|---|
| 5-th percentile | 1676.8 |
| Q1 | 5000 |
| median | 7688 |
| Q3 | 11000 |
| 95-th percentile | 46405.8 |
| Maximum | 1074218 |
| Range | 1073471 |
| Interquartile range (IQR) | 6000 |
Descriptive statistics
| Standard deviation | 37804.856 |
|---|---|
| Coefficient of variation (CV) | 2.4844913 |
| Kurtosis | 252.30781 |
| Mean | 15216.336 |
| Median Absolute Deviation (MAD) | 2743 |
| Skewness | 12.157446 |
| Sum | 42560093 |
| Variance | 1.4292071 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5000 | 51 | 1.8% |
| 6000 | 47 | 1.7% |
| 7200 | 34 | 1.2% |
| 4000 | 27 | 1.0% |
| 8100 | 17 | 0.6% |
| 7500 | 17 | 0.6% |
| 4500 | 16 | 0.6% |
| 4800 | 16 | 0.6% |
| 3000 | 15 | 0.5% |
| 8400 | 13 | 0.5% |
| Other values (2023) | 2544 |
| Value | Count | Frequency (%) |
| 747 | 1 | |
| 750 | 1 | |
| 779 | 1 | |
| 833 | 1 | |
| 835 | 1 | |
| 844 | 1 | |
| 867 | 1 | |
| 868 | 1 | |
| 885 | 1 | |
| 889 | 1 |
| Value | Count | Frequency (%) |
| 1074218 | 1 | |
| 478288 | 1 | |
| 435600 | 1 | |
| 423838 | 1 | |
| 389126 | 1 | |
| 327135 | 1 | |
| 251341 | 1 | |
| 250470 | 1 | |
| 249126 | 1 | |
| 247421 | 1 |
Floors
Real number (ℝ)
| Distinct | 6 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4944583 |
| Minimum | 1 |
|---|---|
| Maximum | 3.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1.5 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 3.5 |
| Range | 2.5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.53679495 |
|---|---|
| Coefficient of variation (CV) | 0.35919031 |
| Kurtosis | -0.45752724 |
| Mean | 1.4944583 |
| Median Absolute Deviation (MAD) | 0.5 |
| Skewness | 0.61901879 |
| Sum | 4180 |
| Variance | 0.28814882 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1365 | |
| 2 | 1052 | |
| 1.5 | 279 | 10.0% |
| 3 | 78 | 2.8% |
| 2.5 | 22 | 0.8% |
| 3.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1 | 1365 | |
| 1.5 | 279 | 10.0% |
| 2 | 1052 | |
| 2.5 | 22 | 0.8% |
| 3 | 78 | 2.8% |
| 3.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 3.5 | 1 | < 0.1% |
| 3 | 78 | 2.8% |
| 2.5 | 22 | 0.8% |
| 2 | 1052 | |
| 1.5 | 279 | 10.0% |
| 1 | 1365 |
Waterfront
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 158.4 KiB |
| 0 | |
|---|---|
| 1 | 16 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2797 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2781 | |
| 1 | 16 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2781 | |
| 1 | 16 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2781 | |
| 1 | 16 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2797 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2781 | |
| 1 | 16 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2797 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2781 | |
| 1 | 16 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2797 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2781 | |
| 1 | 16 | 0.6% |
View
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 158.4 KiB |
| 0 | |
|---|---|
| 2 | 130 |
| 3 | 68 |
| 4 | 36 |
| 1 | 29 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2797 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 3 |
|---|---|
| 2nd row | 2 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2534 | |
| 2 | 130 | 4.6% |
| 3 | 68 | 2.4% |
| 4 | 36 | 1.3% |
| 1 | 29 | 1.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2534 | |
| 2 | 130 | 4.6% |
| 3 | 68 | 2.4% |
| 4 | 36 | 1.3% |
| 1 | 29 | 1.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2534 | |
| 2 | 130 | 4.6% |
| 3 | 68 | 2.4% |
| 4 | 36 | 1.3% |
| 1 | 29 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2797 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2534 | |
| 2 | 130 | 4.6% |
| 3 | 68 | 2.4% |
| 4 | 36 | 1.3% |
| 1 | 29 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2797 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2534 | |
| 2 | 130 | 4.6% |
| 3 | 68 | 2.4% |
| 4 | 36 | 1.3% |
| 1 | 29 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2797 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2534 | |
| 2 | 130 | 4.6% |
| 3 | 68 | 2.4% |
| 4 | 36 | 1.3% |
| 1 | 29 | 1.0% |
Condition
Categorical
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 31 |
| Missing (%) | 1.1% |
| Memory size | 164.0 KiB |
| 3.0 | |
|---|---|
| 4.0 | |
| 5.0 | |
| 2.0 | 17 |
| 1.0 | 4 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 8298 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 2.0 |
| 4th row | 3.0 |
| 5th row | 4.0 |
Common Values
| Value | Count | Frequency (%) |
| 3.0 | 1743 | |
| 4.0 | 749 | |
| 5.0 | 253 | 9.0% |
| 2.0 | 17 | 0.6% |
| 1.0 | 4 | 0.1% |
| (Missing) | 31 | 1.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 3.0 | 1743 | |
| 4.0 | 749 | |
| 5.0 | 253 | 9.1% |
| 2.0 | 17 | 0.6% |
| 1.0 | 4 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 2766 | |
| 0 | 2766 | |
| 3 | 1743 | |
| 4 | 749 | 9.0% |
| 5 | 253 | 3.0% |
| 2 | 17 | 0.2% |
| 1 | 4 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5532 | |
| Other Punctuation | 2766 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2766 | |
| 3 | 1743 | |
| 4 | 749 | 13.5% |
| 5 | 253 | 4.6% |
| 2 | 17 | 0.3% |
| 1 | 4 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2766 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8298 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 2766 | |
| 0 | 2766 | |
| 3 | 1743 | |
| 4 | 749 | 9.0% |
| 5 | 253 | 3.0% |
| 2 | 17 | 0.2% |
| 1 | 4 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8298 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 2766 | |
| 0 | 2766 | |
| 3 | 1743 | |
| 4 | 749 | 9.0% |
| 5 | 253 | 3.0% |
| 2 | 17 | 0.2% |
| 1 | 4 | < 0.1% |
sqftAbove
Real number (ℝ)
| Distinct | 412 |
|---|---|
| Distinct (%) | 14.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1762.0915 |
| Minimum | 370 |
|---|---|
| Maximum | 6070 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.7 KiB |
Quantile statistics
| Minimum | 370 |
|---|---|
| 5-th percentile | 860 |
| Q1 | 1170 |
| median | 1540 |
| Q3 | 2200 |
| 95-th percentile | 3302 |
| Maximum | 6070 |
| Range | 5700 |
| Interquartile range (IQR) | 1030 |
Descriptive statistics
| Standard deviation | 790.32153 |
|---|---|
| Coefficient of variation (CV) | 0.44851333 |
| Kurtosis | 1.6727722 |
| Mean | 1762.0915 |
| Median Absolute Deviation (MAD) | 450 |
| Skewness | 1.21721 |
| Sum | 4928570 |
| Variance | 624608.13 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1140 | 33 | 1.2% |
| 1200 | 31 | 1.1% |
| 1300 | 31 | 1.1% |
| 1320 | 30 | 1.1% |
| 1010 | 30 | 1.1% |
| 1050 | 28 | 1.0% |
| 1400 | 27 | 1.0% |
| 1680 | 24 | 0.9% |
| 1240 | 24 | 0.9% |
| 1090 | 24 | 0.9% |
| Other values (402) | 2515 |
| Value | Count | Frequency (%) |
| 370 | 1 | < 0.1% |
| 420 | 1 | < 0.1% |
| 490 | 1 | < 0.1% |
| 520 | 1 | < 0.1% |
| 550 | 3 | |
| 560 | 1 | < 0.1% |
| 580 | 1 | < 0.1% |
| 590 | 2 | |
| 620 | 1 | < 0.1% |
| 630 | 3 |
| Value | Count | Frequency (%) |
| 6070 | 1 | |
| 6050 | 1 | |
| 5584 | 1 | |
| 5190 | 1 | |
| 5070 | 1 | |
| 4930 | 2 | |
| 4850 | 1 | |
| 4820 | 1 | |
| 4770 | 1 | |
| 4740 | 1 |
sqftBase
Real number (ℝ)
ZEROS 
| Distinct | 175 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 308.29281 |
| Minimum | 0 |
|---|---|
| Maximum | 2550 |
| Zeros | 1650 |
| Zeros (%) | 59.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 610 |
| 95-th percentile | 1180 |
| Maximum | 2550 |
| Range | 2550 |
| Interquartile range (IQR) | 610 |
Descriptive statistics
| Standard deviation | 442.60477 |
|---|---|
| Coefficient of variation (CV) | 1.4356636 |
| Kurtosis | 0.88265297 |
| Mean | 308.29281 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.2895978 |
| Sum | 862295 |
| Variance | 195898.99 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1650 | |
| 500 | 33 | 1.2% |
| 800 | 32 | 1.1% |
| 600 | 28 | 1.0% |
| 700 | 24 | 0.9% |
| 400 | 23 | 0.8% |
| 1000 | 21 | 0.8% |
| 900 | 20 | 0.7% |
| 300 | 17 | 0.6% |
| 480 | 16 | 0.6% |
| Other values (165) | 933 |
| Value | Count | Frequency (%) |
| 0 | 1650 | |
| 20 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 60 | 2 | 0.1% |
| 65 | 1 | < 0.1% |
| 80 | 1 | < 0.1% |
| 90 | 2 | 0.1% |
| 100 | 7 | 0.3% |
| 110 | 1 | < 0.1% |
| 120 | 8 | 0.3% |
| Value | Count | Frequency (%) |
| 2550 | 1 | |
| 2300 | 1 | |
| 2150 | 2 | |
| 2080 | 1 | |
| 2070 | 1 | |
| 1950 | 2 | |
| 1940 | 1 | |
| 1910 | 1 | |
| 1860 | 1 | |
| 1840 | 1 |
Yearbuilt
Real number (ℝ)
| Distinct | 115 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1969.7762 |
| Minimum | 1900 |
|---|---|
| Maximum | 2014 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.7 KiB |
Quantile statistics
| Minimum | 1900 |
|---|---|
| 5-th percentile | 1913 |
| Q1 | 1950 |
| median | 1973 |
| Q3 | 1995 |
| 95-th percentile | 2008 |
| Maximum | 2014 |
| Range | 114 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 29.274488 |
|---|---|
| Coefficient of variation (CV) | 0.014861835 |
| Kurtosis | -0.67477661 |
| Mean | 1969.7762 |
| Median Absolute Deviation (MAD) | 23 |
| Skewness | -0.47264833 |
| Sum | 5509464 |
| Variance | 856.99567 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2006 | 65 | 2.3% |
| 2005 | 62 | 2.2% |
| 2003 | 57 | 2.0% |
| 2007 | 52 | 1.9% |
| 2004 | 51 | 1.8% |
| 2008 | 51 | 1.8% |
| 1978 | 51 | 1.8% |
| 1987 | 49 | 1.8% |
| 1967 | 47 | 1.7% |
| 1968 | 47 | 1.7% |
| Other values (105) | 2265 |
| Value | Count | Frequency (%) |
| 1900 | 19 | |
| 1901 | 2 | 0.1% |
| 1902 | 5 | 0.2% |
| 1903 | 6 | 0.2% |
| 1904 | 7 | 0.3% |
| 1905 | 8 | |
| 1906 | 14 | |
| 1907 | 7 | 0.3% |
| 1908 | 13 | |
| 1909 | 13 |
| Value | Count | Frequency (%) |
| 2014 | 21 | 0.8% |
| 2013 | 26 | 0.9% |
| 2012 | 17 | 0.6% |
| 2011 | 17 | 0.6% |
| 2010 | 14 | 0.5% |
| 2009 | 31 | |
| 2008 | 51 | |
| 2007 | 52 | |
| 2006 | 65 | |
| 2005 | 62 |
YrRenov
Real number (ℝ)
ZEROS 
| Distinct | 54 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 842.63675 |
| Minimum | 0 |
|---|---|
| Maximum | 2014 |
| Zeros | 1616 |
| Zeros (%) | 57.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 43.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2000 |
| 95-th percentile | 2011 |
| Maximum | 2014 |
| Range | 2014 |
| Interquartile range (IQR) | 2000 |
Descriptive statistics
| Standard deviation | 985.94059 |
|---|---|
| Coefficient of variation (CV) | 1.170066 |
| Kurtosis | -1.9011198 |
| Mean | 842.63675 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 0.31555658 |
| Sum | 2356855 |
| Variance | 972078.84 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1616 | |
| 2000 | 108 | 3.9% |
| 2003 | 86 | 3.1% |
| 2001 | 78 | 2.8% |
| 2005 | 72 | 2.6% |
| 2009 | 65 | 2.3% |
| 2013 | 50 | 1.8% |
| 2014 | 49 | 1.8% |
| 2006 | 47 | 1.7% |
| 2004 | 47 | 1.7% |
| Other values (44) | 579 | 20.7% |
| Value | Count | Frequency (%) |
| 0 | 1616 | |
| 1912 | 17 | 0.6% |
| 1913 | 1 | < 0.1% |
| 1923 | 26 | 0.9% |
| 1934 | 4 | 0.1% |
| 1945 | 2 | 0.1% |
| 1948 | 1 | < 0.1% |
| 1953 | 1 | < 0.1% |
| 1954 | 3 | 0.1% |
| 1955 | 2 | 0.1% |
| Value | Count | Frequency (%) |
| 2014 | 49 | |
| 2013 | 50 | |
| 2012 | 29 | |
| 2011 | 36 | |
| 2010 | 14 | 0.5% |
| 2009 | 65 | |
| 2008 | 31 | |
| 2007 | 5 | 0.2% |
| 2006 | 47 | |
| 2005 | 72 |
City
Categorical
| Distinct | 43 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 176.9 KiB |
| Seattle | |
|---|---|
| Renton | |
| Bellevue | |
| Redmond | |
| Kirkland | 119 |
| Other values (38) |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 7.7504469 |
| Min length | 4 |
Characters and Unicode
| Total characters | 21678 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Seattle |
|---|---|
| 2nd row | Seattle |
| 3rd row | Redmond |
| 4th row | Sammamish |
| 5th row | Redmond |
Common Values
| Value | Count | Frequency (%) |
| Seattle | 993 | |
| Renton | 168 | 6.0% |
| Bellevue | 159 | 5.7% |
| Redmond | 142 | 5.1% |
| Kirkland | 119 | 4.3% |
| Kent | 110 | 3.9% |
| Sammamish | 108 | 3.9% |
| Issaquah | 107 | 3.8% |
| Auburn | 100 | 3.6% |
| Shoreline | 85 | 3.0% |
| Other values (33) | 706 |
Length
| Value | Count | Frequency (%) |
| seattle | 993 | |
| renton | 168 | 5.4% |
| bellevue | 159 | 5.1% |
| redmond | 142 | 4.5% |
| kirkland | 119 | 3.8% |
| kent | 110 | 3.5% |
| sammamish | 108 | 3.4% |
| issaquah | 107 | 3.4% |
| auburn | 100 | 3.2% |
| shoreline | 85 | 2.7% |
| Other values (46) | 1040 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 3900 | |
| t | 2415 | |
| l | 2210 | |
| a | 2183 | 10.1% |
| n | 1354 | 6.2% |
| S | 1248 | 5.8% |
| o | 853 | 3.9% |
| r | 685 | 3.2% |
| i | 675 | 3.1% |
| d | 661 | 3.0% |
| Other values (35) | 5494 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18193 | |
| Uppercase Letter | 3150 | 14.5% |
| Space Separator | 334 | 1.5% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3900 | |
| t | 2415 | |
| l | 2210 | |
| a | 2183 | |
| n | 1354 | 7.4% |
| o | 853 | 4.7% |
| r | 685 | 3.8% |
| i | 675 | 3.7% |
| d | 661 | 3.6% |
| u | 618 | 3.4% |
| Other values (14) | 2639 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1248 | |
| R | 315 | 10.0% |
| B | 273 | 8.7% |
| K | 270 | 8.6% |
| I | 151 | 4.8% |
| W | 150 | 4.8% |
| M | 136 | 4.3% |
| F | 113 | 3.6% |
| A | 104 | 3.3% |
| V | 78 | 2.5% |
| Other values (9) | 312 | 9.9% |
Space Separator
| Value | Count | Frequency (%) |
| 334 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21343 | |
| Common | 335 | 1.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 3900 | |
| t | 2415 | |
| l | 2210 | |
| a | 2183 | |
| n | 1354 | 6.3% |
| S | 1248 | 5.8% |
| o | 853 | 4.0% |
| r | 685 | 3.2% |
| i | 675 | 3.2% |
| d | 661 | 3.1% |
| Other values (33) | 5159 |
Common
| Value | Count | Frequency (%) |
| 334 | ||
| - | 1 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 21678 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 3900 | |
| t | 2415 | |
| l | 2210 | |
| a | 2183 | 10.1% |
| n | 1354 | 6.2% |
| S | 1248 | 5.8% |
| o | 853 | 3.9% |
| r | 685 | 3.2% |
| i | 675 | 3.1% |
| d | 661 | 3.0% |
| Other values (35) | 5494 |
StateZip
Text
| Distinct | 76 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 177.5 KiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 22376 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | WA 98144 |
|---|---|
| 2nd row | WA 98126 |
| 3rd row | WA 98053 |
| 4th row | WA 98074 |
| 5th row | WA 98052 |
| Value | Count | Frequency (%) |
| wa | 2797 | |
| 98103 | 89 | 1.6% |
| 98052 | 85 | 1.5% |
| 98115 | 83 | 1.5% |
| 98117 | 82 | 1.5% |
| 98133 | 68 | 1.2% |
| 98034 | 66 | 1.2% |
| 98125 | 64 | 1.1% |
| 98074 | 62 | 1.1% |
| 98155 | 61 | 1.1% |
| Other values (67) | 2137 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 3228 | |
| 9 | 3127 | |
| W | 2797 | |
| A | 2797 | |
| 2797 | ||
| 0 | 2163 | |
| 1 | 1741 | |
| 5 | 783 | 3.5% |
| 2 | 737 | 3.3% |
| 3 | 703 | 3.1% |
| Other values (3) | 1503 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13985 | |
| Uppercase Letter | 5594 | 25.0% |
| Space Separator | 2797 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 3228 | |
| 9 | 3127 | |
| 0 | 2163 | |
| 1 | 1741 | |
| 5 | 783 | 5.6% |
| 2 | 737 | 5.3% |
| 3 | 703 | 5.0% |
| 7 | 532 | 3.8% |
| 6 | 496 | 3.5% |
| 4 | 475 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 2797 | |
| A | 2797 |
Space Separator
| Value | Count | Frequency (%) |
| 2797 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 16782 | |
| Latin | 5594 | 25.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 3228 | |
| 9 | 3127 | |
| 2797 | ||
| 0 | 2163 | |
| 1 | 1741 | |
| 5 | 783 | 4.7% |
| 2 | 737 | 4.4% |
| 3 | 703 | 4.2% |
| 7 | 532 | 3.2% |
| 6 | 496 | 3.0% |
Latin
| Value | Count | Frequency (%) |
| W | 2797 | |
| A | 2797 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22376 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 3228 | |
| 9 | 3127 | |
| W | 2797 | |
| A | 2797 | |
| 2797 | ||
| 0 | 2163 | |
| 1 | 1741 | |
| 5 | 783 | 3.5% |
| 2 | 737 | 3.3% |
| 3 | 703 | 3.1% |
| Other values (3) | 1503 |
| Bedrooms | Bathrooms | sqftLiving | sqftLot | Floors | Waterfront | View | Condition | sqftAbove | sqftBase | Yearbuilt | YrRenov | City | StateZip | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4398 | 5 | 3.75 | 5340 | 10655 | 2.5 | 0 | 3 | 4.0 | 3740 | 1600 | 1912 | 1989 | Seattle | WA 98144 |
| 3495 | 4 | 3.50 | 3370 | 5000 | 2.0 | 0 | 2 | 3.0 | 2470 | 900 | 2008 | 0 | Seattle | WA 98126 |
| 3668 | 2 | 1.00 | 1200 | 24792 | 2.0 | 0 | 0 | 2.0 | 1200 | 0 | 1976 | 0 | Redmond | WA 98053 |
| 3308 | 3 | 2.50 | 1790 | 8144 | 2.0 | 0 | 0 | 3.0 | 1790 | 0 | 1989 | 0 | Sammamish | WA 98074 |
| 1969 | 4 | 2.75 | 2020 | 10720 | 1.0 | 0 | 0 | 4.0 | 1420 | 600 | 1976 | 1992 | Redmond | WA 98052 |
| 1889 | 4 | 2.00 | 2120 | 8701 | 1.5 | 0 | 0 | 4.0 | 2120 | 0 | 1960 | 2001 | Des Moines | WA 98198 |
| 1111 | 4 | 2.50 | 2630 | 48706 | 2.0 | 0 | 0 | 3.0 | 2630 | 0 | 1986 | 0 | Woodinville | WA 98072 |
| 3315 | 2 | 1.75 | 1670 | 4008 | 1.0 | 0 | 0 | 3.0 | 1670 | 0 | 2005 | 0 | Issaquah | WA 98029 |
| 1282 | 4 | 1.75 | 1814 | 5000 | 1.0 | 0 | 0 | 4.0 | 944 | 870 | 1951 | 1999 | Seattle | WA 98115 |
| 2653 | 2 | 1.00 | 740 | 6180 | 1.0 | 0 | 0 | 3.0 | 740 | 0 | 1948 | 1994 | Seattle | WA 98118 |
| Bedrooms | Bathrooms | sqftLiving | sqftLot | Floors | Waterfront | View | Condition | sqftAbove | sqftBase | Yearbuilt | YrRenov | City | StateZip | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4340 | 4 | 2.50 | 2700 | 8810 | 2.0 | 0 | 0 | 3.0 | 2700 | 0 | 2004 | 2003 | Redmond | WA 98052 |
| 361 | 4 | 2.75 | 2280 | 2850 | 1.5 | 0 | 0 | 4.0 | 1540 | 740 | 1930 | 0 | Seattle | WA 98115 |
| 1043 | 2 | 2.50 | 1590 | 2656 | 2.0 | 0 | 0 | 3.0 | 1220 | 370 | 2009 | 0 | Seattle | WA 98106 |
| 3131 | 4 | 2.50 | 1840 | 5550 | 2.0 | 0 | 0 | 3.0 | 1840 | 0 | 2004 | 2003 | Kent | WA 98031 |
| 2275 | 4 | 2.00 | 1680 | 5000 | 1.0 | 0 | 0 | 3.0 | 980 | 700 | 1950 | 2005 | Seattle | WA 98115 |
| 3421 | 4 | 1.50 | 1770 | 5750 | 2.0 | 0 | 0 | 3.0 | 1770 | 0 | 1947 | 2012 | Seattle | WA 98116 |
| 809 | 3 | 2.00 | 1900 | 6660 | 1.0 | 0 | 0 | 5.0 | 950 | 950 | 1966 | 0 | Maple Valley | WA 98038 |
| 3803 | 4 | 2.50 | 2303 | 3826 | 2.0 | 0 | 0 | 3.0 | 2303 | 0 | 2006 | 0 | Auburn | WA 98092 |
| 3158 | 3 | 2.50 | 2540 | 4775 | 2.0 | 0 | 0 | 3.0 | 2540 | 0 | 2006 | 0 | Kent | WA 98042 |
| 451 | 3 | 2.00 | 1690 | 9583 | 1.0 | 0 | 0 | 4.0 | 1690 | 0 | 1969 | 0 | Renton | WA 98059 |
Most frequently occurring
| Bedrooms | Bathrooms | sqftLiving | sqftLot | Floors | Waterfront | View | Condition | sqftAbove | sqftBase | Yearbuilt | YrRenov | City | StateZip | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3 | 2.5 | 1560 | 4200 | 2.0 | 0 | 0 | 3.0 | 1560 | 0 | 2003 | 0 | Maple Valley | WA 98038 | 2 |
| 1 | 3 | 2.5 | 1800 | 2700 | 2.0 | 0 | 0 | 3.0 | 1800 | 0 | 2011 | 0 | Seattle | WA 98126 | 2 |